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Abstract 

We study ordinal embedding relaxations in the realm of parameterized complex- 
ity. We prove the existence of a quadratic kernel for the Betweenness problem 
parameterized above tight lower bound, which is stated as follows. For a set V 
of variables and set C of constraints "u, is between Vj and Vk" , decide whether 
there is a bijection from V to the set {1, . . . , |V|} satisfying at least \C\/3 + k 
of the constraints in C. Our result solves an open problem attributed to Benny 
Chor in Niedermeier's monograph "Invitation to Fixed-Parameter Algorithms." 
An approach developed in this paper can be used to determine parameterized 
complexity of a number of other optimization problems on permutations param- 
eterized above or below tight bounds. 

Key words: Fixed-parameter Tractability, Parameterized Complexity, Kernel, 
Probabilistic Method 



1. Introduction 

The problem of mapping points with measured pairwise distances into a 
target metric space has a long history and been studied extensively from mul- 
tiple perspectives due to its numerous applications. The quality of such an 
embedding can be measured with various objectives; for example isometric em- 
beddings preserve all distances while aiming at low-dimensional target spaces. 
Yet, for many contexts in nearest-neighbor search, visualization, clustering and 
compression it is the order of distances rather than the distances themselves 
that captures the relevant information. The study of such ordinal embeddinqs 
dates back to the 1950's and has recently witnessed a surge in interest [H, H, E3]. 
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In an ordinal embedding the relative order between pairs of distances must be 
preserved as much as possible, i.e., one minimizes the relaxation of an ordinal 
embedding defined as the maximum ratio between two distances whose relative 
order is inverted by the embedding. 

Here we study the one-dimensional ordinal embedding of partial orders that 
specify the maximum edge for some triangles. This problem has been studied 
under the name of Betweenness (see the first problem in A12 of [3), which 
takes a set V of variables and a set C of betweenness constraints of the form 
"vi is between Vj and Vk" for distinct variables Vi,Vj,Vk G V. Such a constraint 
will be written as (vi, {vj, Wfc}). The objective is to find a bijection a from V to 
the set {1, . . . , \V\} that "satisfies" the maximum number of constraints from C, 
where a constraint (i>j, {vj, Vk}) is satisfied by a if either a(vj) < a(vi) < a(vk) 
or a(ffc) < a{vi) < ot(vj) holds. We also refer to a as a linear arrangement 
ofV. 

Such linear arrangements are of significant interest in molecular biology, 
where for example markers on a chromosome need to be totally ordered as to 



satisfy the maximum number of constraints ll|, [15| . More theoretical interest 
comes from the constraint programming framework with unbounded domains 
and interval graph recognition [21| . 

Despite its simple formulation, the Betweenness problem has a challeng- 
ing computational complexity. Already deciding if all constraints can be sat- 
isfied by some linear arrangement is an NP-complete problem (2(|. Hence, the 
maximization problem is NP-hard. On the other hand, any uniformly random 
permutation of the variables satisfies at least one-third of all constraints, and 
this fraction is tight. Better approximation ratios are hard to achieve: the frac- 
tion of one-third is best-possible under the Unique Games Conjecture 0], and 
it is NP-hard to find a linear arrangement that satisfies a 1 — e fraction of the 
constraints for some e S (0, 1/48) [10(. The mere positive result is a polynomial 
time algorithm that either determines that there is no linear arrangement that 
satisfies all m constraints or finds a linear arrangement satisfying at least half 
of them 



A parameterized problem is a subset L C E* x N over a finite alphabet S. 
L is fixed-parameter tractable if the membership of an instance (x, re) in E* x N 
can be decided in time xj ^) • /(re) where / is a computable function of the 
parameter 

0, 

[T3L Eij ], If the nonparameterized version of L (where re is just 
a part of input) is NP-hard, then the function /(re) must be superpolynomial 
provided P ^ NP. Often /(re) is "moderately exponential," which makes the 
problem practically feasible for small values of k. Thus, it is important to 
parameterize a problem in such a way that the instances with small values of re 
are of real interest. 

One can parameterize the Betweenness problem in the standard way by 
asking whether there exists a linear arrangement that satisfies at least k of the 
constraints, where n is the parameter. This parameterized problem is trivially 
fixed-parameter tractable for the following simple reason. A uniformly random 
permutation of the variables in V satisfies one-third of the constraints in expec- 
tation; thus if re < \C\/3 then the answer is "yes" whereas if re > |C|/3 then the 
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instance size is bounded by a function of k and we can solve the problem by 
brute-force. Note that the above standard parameterization is of little value as 
the parameter k will often be large. Thus, it makes sense to consider the follow- 
ing natural parameterization above a tight lower bound of the Betweenness 
problem [25j ]. 

Observe that |C|/3 is a lower bound on any optimal solution. On the other 
hand, for a set C of constraints containing all three possible constraints on each 
3-set of variables, no more than |C|/3 of the constraints in C can be satisfied 
in any linear arrangement. Hence the lower bound of one-third on the fraction 
of satisfiable constraints is tight, in the sense that it is attained by an infinite 
family of instances. So the right question to ask is whether there exists a 
linear arrangement that satisfies at least C|/3 + k of the constraints. The 
parameterized complexity of this problem attributed to Benny Chor is open, and 
was stated as such in [25|. Since the Betweenness problem is NP-complete, 
the complementary question of whether all but k constraints are satisfiable by 
some linear arrangement is not fixed-parameter tractable, unless P = NP. For 
the special case of a dense set of constraints, containing a constraint for each 
3-subset of variables, a subexponential fixed-parameter algorithm was recently 
obtained [J- 

Parameterized complexity of maximization problems parameterized above 
tight lower bounds often tends to be quite a challenging question, with only a 
handful of them answered. A prominent example is to decide whether a planar 
graph G contains an independent set of size |V^(G)|/4 + k; it is unknown whether 
this problem is fixed-parameter tractable or not. Further addressed were above- 
tight-lower-bound parameterizations of Max-Cut, Max-Sat, Max-t-Sat, and 
Linear Ordering [H, [2, EH. Interestingly, all these problems were shown to 
be fixed-parameter tractable. Many open problems in that area are listed in 
[23}; in particular, the parameterized complexity of Max Lin-2 par ameterized 
above a tight lower bound remains unknown despite an effort in [181 ] . 

In this paper we settle Benny Chor's question [25|, p. 43] about the param- 
eterized complexity of the following problem: 

Betweenness Above Tight Lower Bound (BATLB) 

Instance: a set C of betweenness constraints over variables V and 
an integer n > 0. 

Parameter: The integer n. 

Question: Is there a bijection a : V — + {1,...,|F|} that satisfies 
at least |C|/3 + k constraints from C, that is, for at least |C|/3 + k 
constraints (vi, {vj, u/.}) G C we have either a(vj) < a{vi) < a{vk) 
or a(v k ) < a(vi) < a(vj)7 

Our main result is that BATLB is fixed-parameter tractable. Moreover, we 
show that BATLB has a problem kernel of quadratic size, namely, any instance 
is polynomial-time reducible to an equivalent instance of size 0(k 2 ). (We give 
a formal definition of a problem kernel in the next paragraph.) The kernel 
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is obtained via a nontrivial extension of the recently introduced probabilistic 
Strictly Above/Below Expectation Method (SABEM) [ijj], which shows fixed- 
parameter tractability of Linear Ordering and three special cases of Max 
Lin-2 parameterized above tight lower bounds. After the first version of this 
paper was completed (that version was not much different from the current 
one, see |http : //arxiv . org / abs/ 090 7 . 5427vl] >, another paper Q that applies 
SABEM was written. Alon et al. further developed SABEM to prove fixed- 
parameter tractability of Max r-SAT parameterized above a tight lower bound, 
but a simple modification of SABEM in 2] is not applicable to BATLB, sec 
Section [3 

We describe SABEM briefly in Section [5] and point out how to extend it in 
order to obtain a quadratic kernel for BATLB. The necessity to extend SABEM 
lies with the fact that a feasible solution to BATLB is a permutation of vari- 
ables; see Section [2] for a "high-level" discussion and Section [3] for details. Note 
that our extension of SABEM can be used to determine parameterized complex- 
ity of a number of other optimization problems on permutations parameterized 
above or below tight bounds, cf. [l9j |. 

Given a parameterized problem L, a kernelization of L is a polynomial-time 
algorithm that maps an instance (x, k) to an instance (x',k!) (the (problem) 
kernel) such that (i) (x, k) G L if and only if (x 1 , k') G L, (ii) k' < /(«), and 
(iii) \x'\ < g{n) for some functions / and g. The function g(n) is called the 
size of the kernel. A parameterized problem is fixed-parameter tractable if and 
only if it is decidable and admits a kernelization [13| : however, the problem 
kernels obtained by this general result have impractically large size. Therefore, 
one tries to develop kernelizations that yield problem kernels of smaller siz^]. 
A survey of Guo and Niedermeier (l6l | on kernelization lists some problem for 
which polynomial size kernels were obtained. However, polynomial size kernels 
are known only for some fixed-parameter tractable problems and Bodlaender 
et al. 0] proved that many fixed-parameter tractable problems do not have 
polynomial size kernels under reasonable complexity-theoretic assumptions. 

2. Strictly Above/Below Expectation Method 

The Strictly Above/Below Expectation method was recently introduced in 
(l8j to prove fixed-parameter tractability of maximization (minimization, re- 
spectively) problems II parameterized above (below, respectively) tight lower 
(upper, respectively) bounds. In that method, we first apply some reductions 
rules to reduce the given problem II to its special case II'. Then we introduce a 
random variable X such that the answer to II' is Yes if and only if X takes with 
positive probability a value greater or equal to the parameter k. If X happens to 
be a symmetric random variable then the simple inequality P(X > ^/E[X 2 ]) > 



1 Problem kernels are of great practical importance when it comes to solving NP-hard 
problems; they can be interpreted as polynomial-time preprocessing with a quality guaran- 
tee mEa. 
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can be useful; here P(-) and E[-] denote probability and expectation, respectively. 
An application is the Linear Ordering problem and a special case of Max 
Lin-2 parameterized above tight lower bounds [l8j]. However, often X is not 
symmetric. In many such cases the following lemma by Alon, Gutin and Kriv- 
elevich is of use; a weaker version of this lemma was obtained by Hastad and 
Venkatesh 0. 

Lemma 2.1. Let X be a real random variable and suppose that its first, second 
and forth moments satisfy E[X] = 0, E[A 2 ] = a 2 > and E[X 4 } < ba A , 
respectively. Then P(X > > 44 / 3f) ■ 

Often this result can be combined with the next lemma, which is an extension 
of Khinchin's Inequality by Bourgain Q . 

Lemma 2.2. Let f = f(x%, . . . ,x n ) be a polynomial of degree r in n variables 
Xi, . . . , x n with domain { — 1,1}. Define a random variable X by choosing a vec- 
tor (ei, . . . , e„) G { — 1, 1}" uniformly at random and setting X — f{e\, . . . , e„). 
Then, E(A 4 ) < 2 6r (E(A 2 )) 2 . 

These two lemmas were used in 0] and [l|| to prove fixed-parameter tractabil- 
ity of Max r-SAT and a special case of Max Lin-2 parameterized above tight 
lower bounds. Unfortunately, it appears to be impossible to introduce a random 
variable X for which P(X > k) > if and only if the answer to BATLB is Yes 
and such that X is either symmetric or satisfies the conditions of Lemma 12.21 
Thus, in the next section, we introduce X for which we have a weaker property 
with respect to BATLB: if F(X > k) > then the answer to BATLB is Yes. 
This X, however, satisfies the conditions of Lemma 12.21 

To apply Lemma [2TTT we need to evaluate E[A 2 ]. While such evaluations in 
[2] and [li| are rather straightforward, our evaluation of E[A 2 ] is quite involved 
and requires assistance of a computer. Note that we cannot algebraically sim- 
plify X as was done in Q in order to simplify computation of E[A 2 ] . This is due 
to the weaker property of X with respect to BATLB: for some YES-instances 
of BATLB we may have P(X > k) = 0. 

3. A quadratic kernel for BATLB 

We will now show fixed-parameter tractability of BATLB. In fact, we will 
prove a stronger statement, i.e., that this problem has a kernel of quadratic size. 

For a constraint C of C let vars{C) denote the set of variables in C. We 
call a triple A, B, C of distinct betweenness constraints complete if vars(A) = 
vars(B) = vars(C). 

Consider the following reduction rule: if C contains a complete triple of 
constraints, delete these constraints from C and delete from V any variable 
that appears only in the triple. Since for every linear arrangement exactly one 
constraint in each complete triple is satisfied we have the following: 
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Lemma 3.1. Let (V,C) be an instance of BATLB and let (V',C) be obtained 
from ( V, C) by applying the reduction rule as long as possible. Then ( V, C) is a 
YES-instance of BATLB if and only if so is (V',C). 

An instance ( V, C) of BATLB is irreducible if it does not contain a complete 
triple. Observe that using Lemma 13.11 we can transform any instance into an 
irreducible one and it will take no more than 0(m 3 ) time. 

Consider an instance (V, C), for a set V of variables and a set C — {C\, . . . , C m } 
of betweenness constraints, and a random function 4> : V — > {0, 1, 2, 3}. (The 
reason we consider a random function 4> : V — > {0, 1, 2, 3} rather than a ran- 
dom function <fi : V — ► {0, 1} is given in the end of this section.) Let (i(4>) be 
the number of variables in V mapped by <f) to i for i = 0, 1, 2, 3. Now obtain a 
bijection a : V —> {1, . . . , \V\} by randomly assigning values 1, . . . , lo(4>) to all 
a(v) for which <j){v) = 0, and values J2iZo ^i{ < f) + !)•••! Si=o *° au 
for which 4>(v) = j for every j — 1, 2, 3. We call such a linear arrangement a a 
<f>- compatible bijection. It is easy to see that a obtained in this two stage process 
is, in fact, a random linear arrangement, but this fact is not going to be used 
here. 

Now assume that a function <j> : V —> {0,1, 2, 3} is fixed and consider a 
constraint C p = (v,, i>fc}) G C. Let a be a random ^-compatible bijection 
and Vp(u) = 1 if C„ is satisfied and 0, otherwise. Let the weights w(C p ,4>) — 
E(i/ p (cO) - 1/3 and w{C, 0) = £™=i w(C p , 0). 

Lemma 3.2. If w{C,4>) > k then (V,C) * s a YES-instance o/BATLB. 

Proof. By linearity of expectation, w(C,4>) > k implies E(2^L X v p (a)) > 
m/i + k. Thus, if w(C, <f>) > k then there is a ^-compatible bijection a that 
satisfies at least to/3 + k constraints. □ 

Let X = w(C, 4>) and X p = w(C p , </>), p = 1, . . . , m. Observe that if is a 
random function from V to {0, 1, 2, 3} then X, Xi, . . . , X m are random variables. 
Recall that X — Y^p=i X p . 

Lemma 3.3. We have E[X] = 0. 

Proof. Let C p — (Vi, {vj,Vk}) G C. Let us first find the distribution of X p . 
It is easy to check that the probability that <f>(vi) — <j)(vj) — (f>(vk) equals 1/16 
and X p = in such a case. The probability that </>(i>i) ^ (/>(vj) — <t>{vk) equals 
3/16 and X p = —1/3 in such a case. The probability that <f>(vi) equals one of 
the non-equal (f>(vj), 4>(vk) is equal to 6/16 and X p = 1/6 in such a case. Now 
suppose that (f>{vi) , <j){vj) and <j>(vk) are all distinct. The probability that (j>(vi) 
is between 4>( v j) an( i is 2/16 and X p = 2/3 in such a case. Finally, the 

probability that <f>{vi) is not between 4>{Vj) and (j)(vk) is 4/16 and X p = —1/3 
in such a case. Now we can give the distribution of X p in the following table. 

Using this distribution, it is easy to see that E[X p ] = and, thus, K[X] = 



6 



\{</>(Vi), <f>(vj), <j>(v k )}\ 


Relation 


Value of X p 


Prob. 


1 


4>(v t ) = (f)(vj) = (j)(v k ) 





1/16 


2 


<i>{Vi) ^ (f)(Vj) = (j)(v k ) 


-1/3 


3/16 


2 


(t>{Vi) € {(f>(vj) 7 (j)(v k )} 


1/6 


6/16 


3 


4>{vi) is between <j)(vj) and <p(vk) 


2/3 


2/16 


3 


4>{vi) is not between 4>{ v j) an d 0(«fc) 


-1/3 


4/16 



Table 1: Distribution of X p . 



Lemma 3.4. The random variable X can be expressed as a polynomial of degree 
6 in independent uniformly distributed random variables with values —1 and 1. 

PROOF. Consider C p = £ C. Let e\ = -1 if <f)(vi) = or 1 and 

e[ = 1, otherwise. Let e| = — 1 if <p{i'i) = or 2 and e| = 1, otherwise. Similarly, 
we can define e^, ef, e^, e§. Now e^e^ can be seen as a binary representation of 
a number from the set {0, 1,2,3} and e^e^e^e^ele^ can be viewed as a binary 
representation of a number from the set {0, 1, . . . , 63}, where —1 plays the role 
of 0. 

We can write X p as the following polynomial: 

63 

± $>ir'^ • (el + ci 9 )(4 + 4*)(ei + cf )(4 + c")(e* + c*«)(e* + c 2 fe «), 

9=0 

where c^c^c^'c^c^c^ 9 is the binary representation of q, s q is the number of 
digits equal —1 in this representation, and w q equals the value of X p for the 
case when the binary representations of 4>(vi), 4>{vj) and (j)(vk) are c^cJ, 9 , c{ 9 c^ 9 
and c^Cg 9 , respectively. The actual values for X p for each case are given in the 
proof Lemma 13.31 The above polynomial is of degree 6. It remains to recall 
that X = J2 P=1 X P . □ 

Lemma 3.5. For an irreducible instance (V,C) of BATLB we have E[X 2 } > 

768'"- 

Proof. First, observe that E[X 2 ] = £!=i E [^ 2 ] + Ei<;^'< m H x i x i'\- We 
will compute E[JT ; 2 ] and E[X;.XV] separately. 

Using the distribution of Xi given in Table 1, it is easy to see that E[X 2 ] 
1 I 'Hi = 88/768. It remains to show that 

V ElXtXy] > - — m. (1) 
l<tyl'<m 

Indeed, © and E[Xf] = 88/768 imply that 

ELY 2 1 = V ELY 2 1 + V E[XiX v ] > —m - — m = —m, 
1 1 ^ 1 11 ^ 1 1 ~ 768 768 768 

1=1 l<//i'<m 
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In the reminder of this proof we show that fl} holds. Let Ci, Cy be a pair 
of distinct constraints of C. To evaluate E[Xj.XV], we consider several cases. A 
simple case is when the sets vars(C{) and vars(Cii) are disjoint: then Xi and 
Xii are independent random variables and, thus, E[X;X;/] = E[JQ]E[JQ/] = 0. 
Let U = {(1,1') | Ct,Ci> G C,l ^ I'} be the set of all ordered index pairs 
corresponding to distinct constraints in C. We will classify subcases of this case 
by considering some subsets of U. Let 



Si(u) = {(/,/' 



S 2 (u) 
S 3 (u) 
Si(u,v) 

S 5 (u,v) 
S 6 (u,v) 

S 7 (u,v) 
S$(u,v,w) 



= {(1,1' 

= {(U' 

= {(IJ' 

U {(1,1' 

= {Q,l' 

= {(IJ' 

u {(/,/' 

= {(1,1' 

= {(1,1' 



G U : Ci = (u,{a,b}),C v = (u,{c,d}), a,b,c,d<EV}, 

G U : Ci = (a, {«,&}), CV = (c, {u,d}), a,b,c,deV}, 

, (I', l)eU : Ci = (u, {a, b}),C v = (c, {u, d}), a, b, c, d G V}, 

G U : Ci = (u, {v, a}), C v = (u, {v, b}) 7 a,beV} 

G U : Q = (v, {u, a}), C v = (v, {u, &}), a, b G V}, 

G U : Ci = (a, {u, »}), Cj/ = (6, {«, «}), a, 6 G V}, 

, (I', l)eU : Q= (u, {v, a}), C v = (b, {«, «}), a, 6 G 

, (J', € Z7 : Cj = 0, {«, a}), Q, = (6, {«, «}), a, 6 G 1/}, 

, (J', Oct/ : Ci= (u, {«, a}), = («, {«, 6}), a, 6 G 

G C/ : vars(Ci) — vars(Ci>) — {u, v, w}}. 



Let u, v G be a pair of distinct variables. Observe that S±(u, v) = (S\ (u) D 
S 2 (i;))U(Si(v)n &(«)), 5 5 («, v) = S 2 (u) n 5 2 (u), SeM) = (5 3 ( U )nS 2 W)U 
(S 3 (v) n S 2 (uj) and S-?(u,v) = S 3 (u) n ^(w). Let u,v,w G V be a triple of 
distinct variables. Observe that 



S 8 (u,v,u>) = (53(u)nS 3 (w)nS 2 (i«))u(53(w)nS3(«;)nS2(u))u(S3(«')nS3(u)n52(«)). 

(2) 

For a variable u € V, let 6(it) = \{l : Ci = (u,{a,b}),a,b G V}| and 
e(u) = \{l : Ci = (a,{u,b}),a,b G V}\. Observe that \Si(u)\ = b(u)(b(u) - 1), 
|5 2 (u)| = e(u)(e(u) - 1) and \S 3 (u)\ = 2b(u)e(u). 

For a pair u,v G V, let c" = |{Z : C; = (u, {v,a}),a G V^} and c uv — 
\{l : Ci = (o, {«,»}), a G V}|. Observe that |5 4 (u,w)| = cg(c* - 1) + c£« - 1), 
|S5(u, u)j = c ul ,(c TO - 1), \S 6 {u,v)\ = 2(c% + cl) -c uv and \S 7 {u,v)\ = 2c£c£. Let 
»,s,weFbea triple of distinct variables. Since C is irreducible, the number 
of ordered pairs (C\, Ci>) for which vars(C'i) — vars(Ci>) = {u, v, w} is at most 
2, i.e., \S 8 (u,v,w)\ < 2. 

We list the sets Si(-), their union/intersection forms (for i = 4,5,6,7) and 
their sizes in Table 2. If (I, I') belongs to some Si but to no Sj for j > i, then 
Table 2 also contains the value 768 • E[X;X;/], in the row corresponding to Si. 
These values cannot be easily calculated analytically as there are many cases to 
consider and we have calculated them using computer. We will briefly describe 
how our program computes EpQJQ/] using as an example the case (I, I') G Si(u), 
i.e., C t = (u,{a,b}),C v = (u,{c,d}). For each (gi, g 2 , 3s, 34, Ss) G {0, 1,2,3} 5 
the probability of (u, a, 6, c, d) = (qi, q 2 , q 3l 94, (75) is 4~ 5 and the corresponding 
value of XiXi' can be found in Table 2. 
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Set 


Union/intersection Form 


| Set | 


768E[X;X ( /] 


768u/ 








12 = wi 


12 


S 2 (u) 




e(u)(e(u) - 1) 


3 = w 2 


3 


Sa{u) 




b(u)e(u) + e{u)b{u) 


-6 = tU 3 


-6 


Sa(u,v) 


{S 1 {u)nS 2 (v))u{S 1 (v)nS 2 (u)) 


c*(c«- l)+<«-l) 


24 = w 4 


9 


S 5 (u,v) 


s 2 (u)ns 2 (v) 


CtiU (c uv 1 ) 


36 = w$ 


30 


S 6 (u,v) 


(s 3 (u)ns 2 (v))u(s 3 (v)ns 2 (u)) 




-18 = ui 6 


-15 


S 7 (u,v) 


s 3 (u)ns 3 (v) 




—6 = ?«7 


G 


S$(u,v,w) 


see © 


< 2 


-44 = w 8 


-11 



Table 2: Data for sets S l {u), i = 1, 2, . . . , 8. 

We are now ready to compute a lower bound on the term £i<;^;/< m 

E[XiX v }. 

Define the values w\ for i — 1, 2, . . . , 8 as it is done in Table 2. We will now 
show that the following holds (note that the sets we sum over have to contain 
distinct elements). 

Ei<i^'< m E [^^'] = E ue y ELi l&MK +£{ U) t,}cv El=4 |Sj(u,u)K 

+ E{u,i;,«}CV |5 8 (t*,»)m 
In order to show the above we consider the possible cases for (I, I') G U. 

Case 1: \vars(Ci) fl ?;ars(CV)| = 0. In this case E[X;X;/] = and the corre- 
sponding (1,1') does not belong to any Si and therefore contributes zero to the 
right-hand side above. 

Case 2: \vars(Ci) fl vars(Ci<)\ = 1. Each pair (1,1') 6 <Si(u) contributes ^ to 
both sides of the above equation, as in this case (1,1') does not belong to any Sj 
with j > 1. Analogously if (1,1') G S 2 (u) then it contributes to both sides 
of the above equation. Furthermore if (I, I') E S 3 (u) then it contributes — y|g. 

Case 3: \vars(Ci)nvars(Ci')\ = 2. Consider a pair (1,1') G 84(11, v) and assume, 
without loss of generality, that (I, I') G Si(u)<lS 2 (v). Note that (I, I') contributes 
to the left-hand side of the equation and it contributes w[ +w'2 + w' 4 = to 
the right-hand side (as (I, I') G Si(u) n S 2 (v) n S^u, v)). Analogously if (I, I') G 
Ss(u, v) we get a contribution of w 5 = ^ = + w' 2 + w' 5 to both sides of the 
equation. If (I, I') G Sq(u, v) we get a contribution of wq — = w' 3 + w' 2 + w' 6 

to both sides of the equation. If (1,1') G Sj(u,v) we get a contribution of 
wj — — y|g = w' 3 + w' 3 + w'j to both sides of the equation. 

Case 4: \vars(Ci) C\vars(Cv)\ — 3. Assume, without loss of generality, that 
(1,1') G S 3 (u)nS 3 (v)r\S 2 (w) and note that (1,1') G SV(u,v)nS 6 (u, w)nS 6 (v,-u;). 
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Therefore we get a contribution of Wg = —jgg = w^ + w^ + w^ + w^ + Wg + Wg+Wg 
to both sides of the equation. 

Therefore the above equation holds, which implies the following: 

Ei<j,«>< TO E[*i*i'] = Euev (\Si(u)\w[ + \S 2 (u)\w' 2 + \S 3 (u)\w' 3 ) 

+ E{u,v}CV Ei=4 V)H + E{u,v,w}CV \ S s( U i V > W )\ W S 

= 2T68 T.uev (6(26(«) - e{u)f - 2Ab(u) - 6e(u)) 

+ Th £{«,„}cv (i5« +<- 2c™) 2 + 12 (^) 2 - 18(c« + cl) - 60c™j 

To complete the proof of the lemma it remains to translate this sum into a 
function on the number of constraints. In that respect, notice that Yluev b( u ) = 
m and J2utv e ( u ) ~ 2m. Further, each clause (it, {v,u>}) contributes exactly 
one unit to each of c" and c^J,, as well as exactly one unit to c vw . Hence 
J2{ u ,v}cv( c v + c u) = 2m and J2{ u ,v}cv Cuv = m - Since C is irreducible, the 
number of ordered pairs (C;, CV) for which vars(Ci) = vars(Ci') is at most m/2 
and, thus, 

^2 \S s (u,v, w)\w 8 < m ■ w' s . 

{u,v,w}GV 

Together these bounds imply that 

M y y] > 36 96 11 77 

> mAAiJiv > m m m = m 

4^ " 2 ■ 768 2-768 768 768 

and (P) holds. □ 

We are now ready to prove the main result. 
Theorem 3.6. BATLB has a kernel of size 0(n 2 ). 

PROOF. Let (V,C) be an instance of BATLB. By Lemma [3TTI in time 0(m 3 ) 
we can obtain an irreducible instance (V' , C) such that (V, C) is a YES-instance 
if and only if (V',C) is a YES-instance. Let ml = \C'\ and let X be the random 
variable defined above. Then X is expressible as a polynomial of degree 6 by 
Lemma EU hence it follows from Lemma O that ELY 4 ] < 2 36 E[Y 2 ] 2 . Con- 
sequently, X satisfies the conditions of Lemma 12.11 from which we conclude in 

combination with Lemma \3. 51 that P (^X > j^m \J^ e \ m '^j > 0. By Lemma \3. 2 1 

if j^\p^gm' > K then (V',C) is a YES-instance for BATLB. Otherwise, we 
have ml = 0(k 2 ). This concludes the proof of the theorem. □ 
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We complete this section by answering the following natural question: why 
have we considered functions <j> : V — » {0, 1, 2, 3} rather than functions <f> : V — > 
{0,1}? The latter would involve less computations and give a smaller degree 
of the polynomial representing X. The reason is that our proof of Lemma 13.51 
would not work for functions <j> : V — * {0, 1} (we would only be able to prove 
that E[X 2 } > J2{ u .v}cvi c u + c v ~ 2c uv } 2 , which is not enough). 
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